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Abstract 

Several equivalent formulations of the additivity conjecture for con- 
strained channels, which formally is substantially stronger than the 
unconstrained additivity, are given. To this end a characteristic prop- 
erty of the optimal ensemble for such a channel is derived, generalizing 
the maximal distance property. It is shown that the additivity conjec- 
ture for constrained channels holds true for certain nontrivial classes 
of channels. After giving an algebraic formulation for the Shor's chan- 
nel extension, its main asymptotic property is proved. It is then used 
to show that additivity for two constrained channels can be reduced 
to the same problem for unconstrained channels, and hence, "global" 
additivity for channels with arbitrary constraints is equivalent to ad- 
ditivity without constraints. 

Running title: Shor's channel extension and constrained channels 

1 Introduction 

In the recent paper [T4J Shor gave arguments which show that conjectured 
additivity properties for several quantum information quantities, such as the 
minimal output entropy, the Holevo capacity (in what follows x _ca P ac ity) 
and the entanglement of formation are in fact equivalent. An important 
new tool in these arguments is the construction of special extension <3> for 
an arbitrary channel $ which has desired properties lacking for the initial 
channel. In this paper we show that this extension allows us to deal with 
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the additivity conjecture for quantum channels with constrained inputs. In- 
troducing input constraints provides greater flexibility in the treatment of 
the additivity conjecture. In a sense, Shor's channel extension plays a role of 
the Lagrange function in optimization for the additivity questions. On the 
other hand, while Jl] deals with the "global" additivity, i.e. properties valid 
for all possible channels, in this paper we make emphasis on results valid for 
individual channels. 

We start with giving several equivalent formulations of the additivity con- 
jecture for constrained channels (theorem 1), which formally is substantially 
stronger than the unconstrained additivity. To this end a characteristic prop- 
erty of the optimal ensemble for such a channel is derived (proposition 1), 
generalizing the maximal distance property [llj. It is shown that the ad- 
ditivity conjecture for constrained channels holds true for certain nontrivial 
classes of channels (proposition 2). After giving an algebraic formulation for 
the Shor's channel extension its main property (proposition 3) is proved. 
It is then used to show that additivity for two constrained channels can be 
reduced to the same problem for unconstrained channels, and hence, global 
additivity for channels with arbitrary constraints is equivalent to global ad- 
ditivity without constraints (theorem 2 and corollaries). Further results in 
this direction can be found in 0]. 



2 Basic quantities 

Let H, Ti' be finite dimensional Hilbert spaces and let $ : &(TC) i— »■ &(H') be 
a channel, where &(H) denotes the set of states (density operators) in H. 
Let {7Tj} be a finite probability distribution and {p^} a collection of states in 
&(TC), then the collection {tt^ p^} is called ensemble, and p av = J2i n iPi * s its 
average. 

An important entropic characteristic of ensemble is defined by 

(K Pl }) = H \J2 - E ^ H ( $ (*)) > w 

where H (-) is the von Neumann entropy. Following jH], we denote 

XM = maxx4.({vr i ,p J }). 

Pav=P 
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Notice that 

X *(p) = H(*(p))-&*(p), (2) 

where 

H$ (p) = min V TTiH ($ ( Pi )) . 

Pav=p * ' 

i 

The function H$ (p) is the convex closure [T] (or the convex roof, cf. [To] ) 
of the output entropy H (<& (p)) , which is continuous concave function. The 
function H§ (p) is a natural generalization of the entanglement of formation 
and coincides with it when the channel $ is a partial trace. The continuity 
of H$ (p) follows from the MSW correspondence [U] and the continuity of the 
entanglement of formation j7j. Thus the function (briefly x -function) 

is itself continuous and concave on &(TC). 

Consider the constraint on the ensemble {iTi,pi} defined by the require- 
ment p av G A, where A is a closed subset of &(H). A particular case is linear 
constraint, where the subset A 1 is defined by the inequality TrAp av < a for 
some positive operator A and a number a > 0. Define the x~ ca P ac ^y of the 
.4-constrained channel $ by 

A) = maxx*(p) = max x^d^h Pi})- (3) 
p£A Pz V eA 

In case of the linear constraint A 1 we also use the notation C*($; A, a). Note 
that the x-capacity for the unconstrained channel is = &(7i)). 

Lemma 1. For arbitrary channel $ : &{7i) \— > &(Ti!) and arbitrary 
density operator po of full rank there exists a positive operator A < 1% in 
03 such that p the maximum point of the function x<s>{p) under the 
condition TiAp < a, where a = Trv4p . 

The statement of the lemma is intuitively clear, but its proof (see Ap- 
pendix, I) requires an argument from the convex analysis due to the fact that 
the function x*(p) ma y n °t be smooth. 

3 Optimal ensembles 

An ensemble {-Ki, p{\ on which the maximum in Q is achieved is called an 
optimal ensemble for the A -constrained channel The following propo- 
sition generalizes the maximal distance property of optimal ensembles for 
unconstrained channels [TTj . 
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Proposition 1. Let A be a closed convex set. The ensemble {^i, Pi\ with 
the average state p av £ A is optimal for the A- constrained channel $ if and 
only if 

^^•#($(^)||$(p av )) < #}) 

j 

for any ensemble with the average uj w e A, where H{-\\-) is the 

relative entropy. 

Proof. The proof generalizes the argument in JT] by considering varia- 
tions of the initial ensemble involving not a single component but the whole 
ensemble. 

Let {7Tj, Pi\i = x and {/ij, Wj}^Li be two ensembles with the averages p av and 
c<j av contained in A. Consider the variation of the first ensemble by mixing it 
with the second one with the weight coefficient 77. The modified ensemble 

£*» = {(1 - 77)7TiPi, (1 - 7?) 7r nPn, Wm^m} 

has the average p!? v = (1 — r ?)Pav + ^^ av G A (by convexity). Using the relative 
entropy expression for the quantity (JTJ), we have 

n m 
i=l j=l 

Applying Donald's identity [TT], J2| to the original ensemble we obtain 

n 

^M^)\mpi)) = x^ )+Hm Pav )\m P i)). 
1=1 

Substitution of the above expression into (jlj) gives 

X* (57») = + (1 - r?)if($ (p av )||$(pL)) 



Applying Donald's identity to the modified ensemble we obtain 



(5) 



n m 

;i - 77) e 7 t^($( Pj )||$(p av )) + 77 e hH(* (^)||$(Pav)) 
i=i i=i 

= X *(S^)+i/($(p2 v )||$(Pav)) 
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and hence 



Y,li S H{*{w s )\\S{p Kt ))-x* 

3=1 



(6) 



Since the relative entropy is nonnegative, the expressions (0) and © imply 
the following inequalities for the quantity Ax$ = X* (^) — X* (^°) : 



E^(^-)||$(p2,))-x* 

i=i 

< A X $ < 

m 

E^(*(wi)||*0w))-x* 



(7) 



Now the proof of the proposition is straightforward. If 

i 

for any ensemble {jti^tt/,-} of states in &( 7i) with the average aj av G .A, 
then by the second inequality in (J7J) with rj = 1 we have 

= X*^ 1 ) < x$ = 

which means optimality of the ensemble {7Tj,pj}. 

To prove the converse, suppose {iii, pi} is an optimal ensemble and there 
exists an ensemble {fij, Uj} such that 

^^(*( Wi )||*0w))>x« 
j 

By continuity of the relative entropy, there is rj > such that 



By the first inequality in (JJJ), this means that x$ (^ T? ) > X$ (^°) i n contra- 
diction with the optimality of the ensemble {7Tj,pj} . □ 
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Corollary 1. Let p av be the average of an optimal ensemble for the 
A- constrained channel $, then 

C($; A) = x*(Pav) > xM + H(*{p)\\${p w )), Vp G A. 

Proof. Let {iTi, p{\ be an arbitrary ensemble such that Yli^iPi = P £ A. By 
proposition 1 

J2^H(<5>(pi)\\<S>(p av )) < X*(Pav). 

i 

This inequality and Donald's identity 

i 

complete the proof. □ 

4 Additivity for constrained channels 

Let \1/ : ©(/C) i— > (5(/C') be another channel with the constraint, defined by a 
closed subset B C (5(/C). For the channel $ <8) \l/ we consider the constraint 
defined by the requirements af v := Tr^cr av G ^4 and cr* := Tr w cr av G £>, where 
cr av is the average state of an input ensemble {pi,<Tj}. The closed subset of 
&{H®1C) defined by the above requirements will be denoted A® B. 

We conjecture the following additivity property for constrained channels 

m-A®B) = C($;A) + C(V;J3). (8) 

The usual additivity conjecture for unconstrained channels is obtained by 
setting A =&(H), B =©(£). 

Theorem 1. Let $ and be fixed channels. The following properties 
are equivalent: 

(i) equality (0) holds for arbitrary closed A and B; 

(ii) equality (0) holds for arbitrary linear constraints A 1 and B l ; 

(iii) for arbitrary a G &(T~t ® K) 

x^(o-) < + x*0*); (9) 

(iv) for arbitrary a G &(TC <g> JC) 

H* 9 y(<r) > #*(<7*) + #*(<7*)l (10) 
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These are also equivalent to the corresponding additivity properties of x<s> 
and H$ for tensor product states. By using the MSW correspondence the 
case of Hq, can be reduced to entanglement of formation, for which this was 
established in [H], [TP] . 

Proof, (i) =>- (ii) is obvious, (ii) =>- (i) can be proved by double applica- 
tion of the following lemma. 

Lemma 2. The equality holds for fixed closed B and arbitrary closed 
A if it holds for the set B and arbitrary linear constraint A 1 , defined by the 
inequality TrAp < a with a positive operator A and a number a such that 
there exists a state p' with TrAp' < a. 

Proof. Assume that the equality (jSJ) holds for the set B and arbitrary set 
A 1 , satisfying the above condition. It is sufficient to prove that 

x*®*(ff) <**(**)+ (ii) 

for any o G &(TC®K.) such that cr* G B. Due to continuity of the x -function, 
it is sufficient to prove (fTTj) for a state a with partial trace a* of full rank. 
For the state a* we can choose a positive operator A in 05 (Ti.) in accordance 
with lemma 1. Let A 1 = {p G &(T~l) | TrAp < a = TrAx*}. The full rank of 
cr* guarantees the existence of a state p' such that TrAp' < a = TrAcr*. Let 
uj be the average state of the optimal ensemble for the £>-constrained channel 

Due to the above assumption the state a* ® ui is the average state of the 
optimal ensemble for A ® £>-constrained channel $ ® \F But it is clear that 
this ensemble will also be optimal for {cr*} ® £>-constrained channel $ (g> \& 
and, hence, (|TTjl is true. □ 

ii) =>- (iv). Fix the states p and u and take A = {p}, B = {uj}, then (JSJ) 
becomes 

C ($ ® {p} ® M) = (7($; {p}) + ; {w}). (12) 

This implies existence of unentangled ensemble with the average p®u, which 
is optimal for the {p} ® {uj} -constrained channel $ Cg) ^. By corollary 1 we 
have 

X$®*(p ® w) = X*(p) + >X*®*W + H(($ ® *)(<r) ||$(p) ® (13) 

for any state a G &(7i) ® £>(/C) such that cr* = p and cr* = . Note that 

#(($ ® *)((T)||$(p) <8> = #($(p)) + #(*H) - ® (14) 

The inequality (jT3j) together with (f*HJ) and (0) implies (JTUJ). 
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(iv) =>- (Hi) obviously follows from the definition of the x -function and 
subadditivity of the (output) entropy. 

(Hi) =^ (i). From the definition of the %-capacity and Q 

C ($ ® A ® B) < A) + ; B). 

Since the converse inequality is obvious, there is equality here. □ 

Remark 1. The additivity of the \~ capacity for arbitrarily constrained 
channels is formally substantially stronger than the usual unconstrained addi- 
tivity. Indeed, the latter holds trivially for channels that are (unconstrained) 
partial traces, but the additivity for constrained partial traces, by the MSW 
correspondence, would imply validity of the global additivity conjecture. 

The following proposition implies that the set of quantum channels sat- 
isfying the properties in theorem 1 is nonempty. We shall use the following 
obvious statement 

Lemma 3. Let {$j}™ =1 be a collection of channels from &(T~C) into 
&(TCj), and let {gj}™ =1 be a probability distribution. Then for the channel 
$ = ®j =1 qj$j from &(H) into &(@ n j=1 Hj) one has 

n 

X* ({pi, TTi}) = q jX*i ^ • D 

We shall call $ the direct sum mixture of the channels {$j}™ =1 . 
Proposition 2. Let ^ be an arbitrary channel. The inequality (CJ) holds 
in each of the following cases: 

(i) $ is a noiseless channel; 

(ii) $ is an entanglement breaking channel; 

(Hi) $ is a direct sum mixture of a noiseless channel and a channel <3> 
such that (GJ) holds for $ and ^ (in particular, an entanglement breaking 
channel). 

An obvious example of a channel of the type (Hi) is erasure channel. 

Proof, (i) The proof is a modification of the proof in 3; of the "uncon- 
strained" additivity for two channels with one of them noiseless, based on 
the Groenevold-Lindblad-Ozawa inequality [H] 

H(a) > Y^pjHfa), (15) 

3 
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where a is a state of a quantum system before von Neumann measurement, 
<jj — the posterior state with the outcome j and pj is the probability of this 
outcome. 

Let $ = Id be the noiseless channel and let p be an arbitrary state in 
&{H). We want to prove that 

C(Id © {p} ® {u}) = C(ld, {p}) + C(V, {u}) = H(p) + X *(u) (16) 

Let {pi, cjj} be an ensemble of states in &(H © fC) with /ijaf = p, p^crf = 
tu. By subadditivity of quantum entropy 

({p u a}) = H(ld © *(£ //^)) - £ ^i^(Id ® 

(17) 

< #(p) + #(*H) - £ ^#(Id © ttfo))- 

i 

Consider the measurement, defined by the observable {|ej)(ej| ® Ik}, where 
{| e i)} * s an orthonormal basis in TC. By (|15|) we obtain 

#(Id © *(«t0) > ^2 Pij H(*(p-%)), for all i, 

3 

where P ij = (e^ | | e^) and = P^- l e j)( e il ® • o"i • l e i)( e j| ® ^jc- Note that 
YljVijvfj — °f an d Ylij ^iPij a fj = 00 ■ This and previous inequality show 
that two last terms in (fTTj) do not exceed XviifaPiji a fj}) hence, X^t^O- 
With this observation ( IT?]) implies (fTBj) and hence the proof is complete. 

(it) See [T3] where the additivity conjecture for two unconstrained chan- 
nels with one of them is entanglement breaking was proved. In the proof 
of this theorem the subadditivity property of the x -function was in fact es- 
tablished. We can also deduce the subadditivity of the x-function from the 
unconstrained additivity with the help of corollary 2 (see Sec. 5 below). One 
should only verify that entanglement breaking property of a channel implies 
similar property of Shor's extension for that channel. 

(Hi) Let $ g = gld © (1 — q)$>o. For an arbitrary channel \I/ we have 
$ 9 © \& = g(Id <g> © (1 - g)($ © By using lemma 3 and subadditivity 
of the functions Xm®* and x* ®*' 

< gXM®*(<7) + (1 - q)x<b m{°) 

< qxu(cr^) + gx*(^*) + (i - g)x* (° r$ ) + (i - q)x*(^) 

= qH(a*) + (1 - q) X * (v*) + = *•>*) + 
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where the last equality follows from the existence of a pure state ensemble 
on which the maximum in the definition of X*o( a *) * s achieved. □ 

5 Shor's channel extension 

Let $ be a channel from &(Ti) to &(Ti'), and let E be an operator in 
05( Ti), < E < I. Let q £ [0; 1] and rf 6 N = {1,2,...}. Shor's channel 
extension $ with probability l — q acts as the channel <3> and with probability 
q makes a measurement in Ti with the outcomes {0, 1} corresponding to the 
resolution of the identity {^E^-^E^ , where we denote E 1 - = I — E. If the 
outcome is l,then logrf classical bits are sent to the receiver, otherwise - a 
failure signal [TI]. Later q will tend to zero while d — to infinity, such that 
glog<i = A will be constant. The channel $ will then mostly act on input 
states p as <3>, at the same time rarely sending a lot of classical information 
at the rate proportional to the value TipE, which to some extent explains its 
relation to the capacity of channel $ with constrained inputs to be explored 
in this section. 

Translating the definition into algebraic language, consider the following 
channel <&(E, q, d), which maps states on 05(H) © C d into states on 05 (H') © 
C d+1 , where C d is the commutative algebra of complex ^-dimensional vectors 
describing a classical system. By using the isomorphism of 05 (Ti) © C d with 
the direct sum of d copies of 05 (Ti), any state in 23(7i)(S>C d can be represented 
as an array {pj} d =1 of positive operators in 05 ( Ti) such that ^Y^j=iPj = 1- 
The action of the channel §(E,q,d) on the state p = {pj} d =1 with p = 
Y^j=i Pj i s defined by 

q, d)(p) = (1 - g)$ (p) © 

where $ (p)= $(p) e &(H') and $i(j5) = [TipE ± , Ti Pl E, ...,Trp d E) e C d+1 . 
Note that $ and $i are channels from 05(H) © C d to 05 (W) and to C d+1 
correspondingly. The input state space of the channel q, d) will be 
denoted ©g. 

Remark 2. More precisely, since in this paper channel means a map 
defined on the algebra of all operators in the input Hilbert space, the action 
of &(E, q, d) should be extended correspondingly. Then C d is considered as 
the algebra of diagonal matrices acting in d— dimensional Hilbert space Tid, 
and the input algebra of the channel 05(H) © C d C 05 (7i © Tid), while the 
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output algebra 58 (W) © C d+1 C »(W © The action of g, d) can 

then be naturally extended to the whole of 05 (Ti. © 7^^) by letting $ vanish 
on the elements A ©5, where A e *B(7i) and .B is any matrix with zeroes on 
the diagonal, acting in Hd- This is described in jHj by saying that the first 
action of Q(E, q, d) is to make a measurement in the canonical basis of Tid- 

Proposition 3. Let \1/ : <5(/C) i— > &{K') be an arbitrary B- constrained 
channel. Consider the channel <&(E,q,d) © \l/ . Then 

C ($(£, q, d) © , 6a © B) - max (<t) + g log dTr cr(E © I*;)] 

< g(logdim/C + 1). 

Proof. Due to the representation 

g, d) © * = (1 - q) ($ © *) © g ($i © *) , (18) 

lemma 3 reduces the calculation of the quantity X$(e q d)®<& f° r an y ensemble 
of input states to the calculation of the quantities x* ®* and X*i«>* f° r this 
ensemble. 

Note that any state a in *B(TC) © C d © 23 (/C) can be represented as an 
array {ct^^-j of positive operators in Q3(7Y © K.) such that Tr = x Oj = 1. 
Denote by <5j(cr) the array <r with the state cr in the j-th position and with 
zeroes in other places. 

It is known that for any channel there exists a pure state optimal en- 
semble [llj and that the image of the average state of any optimal en- 
semble is the same (this follows from corollary 1). These facts and sym- 
metry arguments imply existence of an optimal ensemble for the channel 
$(E,q,d) © ^ consisting of the states a it j = Sjfa) with the probabilities 

= d~ l Hi, where {//j,^} is an ensemble of states in &(H © ]C) (cf. [TI]). 
Let cr av = J2i j and °"av = J2i A^ "* be the averages of these ensembles. 

Note that cr av = [<i _1 cr av , d _1 cr av ]. 

The action of the channel $o©^ on the state a = [flj] . =1 with a = J2 i=1 <J{ 

is 

$ © \&(<r) = $ © *(cr). 
Hence $ © = $ © *(<7i) and 

X*o«)*({/ i *,i' = X*®*^/^) (19) 
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Let us prove that 

X*i®*({A**j>^ij}) = log dTi(T av (E <g> J^) + /#({//«, <r<}), (20) 

where < f$({fii, c,}) < log dim /C' + 1. It is easy to see that the action of 
the channel $i £§) $ on the state <r = [flj] . =1 with a = Yli=i a « I s 

$i eg) = [^x(a), ^e(ffi), *bW], 

where ^a(') = Trn{A®Iic)(ld®ty)(-) is a completely positive trace-nonincreasing 
map from 03(7i ® JC) into 03 ( /C'), (A = E.E^, and Id is the identity map 
on &{H)). 
Therefore, 

® = if^x^)) + # OM^)), (21) 

and _ 

Due to this 

ff^iStffoOHlogd M £ (a av ) + /7(^K v ))+if(^x((T av )). (22) 
Using (J2U, ("""2D and M b (ct) = Tro"(E ® J^), we obtain 

= logdTr<r av (£ <g> I K ) 
+H^> E {a sv )) + H^ E ^)) - £//;(ii(^K)) + if (^xfo))) (23) 

i 

= logc/Tr<j av (E ®/jc) +X* B ({/Wi,c r i}) + X^ ± ({/i i ,a i }). 

Using the inequalities < if OS 1 ) < TrS^logdim'H — log TrS") for any positive 
operator S G B(TC), and ft^Or) = x logx + (1 — x) log(l — x) < 1, it is possible 
to show that 

f£({fM,<7i}) ■= X9 E ({fJ-i,CTi}) +X* B± ({/ii,o"i}) < log dim /C' + 1, (24) 
hence we obtain (12*0*1) . 
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Lemma 3 with (jTHj) and (|2(J|) imply 



X$(K, 3 ,d)®*({M»J> ^ij}) - I 1 _ 9)X#o®*({fe) + 9X*i®*({&,i> 

= (1 - g)x$®*({/ii,<7i}) + glog<iTr(T av (E ® J/c) + qf$({fJ>i, 

The last equality with (J2"^) completes the proof. □ 

Theorem 2. Lei $ : 6(H) i-> 6(W) and * : 6(/C) i-> 6(/C') 5e 
arbitrary channels with the fixed constraint on the second one defined by a 
closed set B. The following statements are equivalent: 

(i) The additivity (jHj) holds for the A-constrained channel $ with arbitrary 
closed A € ©(H) and the B- constrained channel 

(ii) The additivity holds asymptotically for the sequence of the channels 
{§(E, \/ log d,d)}d£m with arbitrary operator < E < I and arbitrary non- 
negative number A (without constraints) and the B- constrained channel ty, 
in the sense that 

lim C($(E,\/logd,d)®V,&$®B) = lim C($(E, X/\ogd, d))+<7(¥, B). 

Proof. Note, first of all, that for an operator < E < I and a number 
A > proposition 3 implies 

lim C(<$>(E, \/ log d, d)) = max + XTrpE) (25) 

d—*+oo p 

and 

lim C($(E, \/\ogd,d)®ty,&Q®B) = max [x*®*(c) + A Tr a(E ® I K )\ 

(26) 

correspondingly 

Begin with (i) =>- (ii). Let a* be a maximum point in the right side of 
f!25|) and a = Tra^(E <g> 1%). By the statement (i) the additivity holds for 
the channel $ with the constraint TrpE ± < 1 — a and the B -constrained 
channel ^. So there exist such states p and uj G B that TrpE > a and 
X*(p) > Hence 

max [x$®*(cr) + ATr<r(£ ® 7^) ] = + ATrcr*(-E ® J/c) 

cr:cr*eB 

< XM + + XTrpE < max + XTipE] + C(*; B). 

p 



13 



Due to and (|26|) this means that 

lim A/ log d, d)®¥, < lim C($(E,\/logd,d)) + C(V,B) 

d^+oo d^+oo 

which implies (it). 

The proof of (ii) =>- (i) is based on lemma 2. Let A 1 be a set defined by 
the inequality TrpA < a with an operator < A < I and a positive number 
a such that there exists a state p' with TrpA < a. Due to lemma 2 it is 
sufficient to show that 

C($®V;A l ®B) <C($;A l ) + C{V]B), (27) 

that is, for all ensembles {/ij, a{\ in (g> fC) with the average <7 av , such 
that Tra* A < a, a* G B, 

X^*Kv) < C{&, A 1 ) + C{% B). (28) 

Let p av be the average state of the optimal ensemble for the ^-constrained 
channel <3> so that A 1 ) = x*(Pav)- Note that the state p av is the point of 
maximum of the concave function X®{p) with the constraint Tip A < a. By 
the Kuhn- Tucker theorem (we use the strong version of this theorem with 
the Slater condition, which follows from the existence of a state p 1 such that 
TipA < a ) jS], there exists a nonnegative number A, such that p av is the 
point of the global maximum of the function x$(p) — ATrpA and the following 
condition holds 

A(TrAp av - a) = 0. (29) 

It is clear that p av is also the point of the global maximum of the concave 
function x$(p) + XTrpE, where E = I — A, so that 

X*(p) + XTrpE < x$(p av ) + ATrp av £, Vp e 6(H). (30) 

Consider the sequence &(E, A/logd, d). Assumed asymptotic additivity to- 
gether with ()25|) and (|26|) implies 

max [x*®*(<7) + ATr cr(E <g> J^)] = max fr*(p) + ATrp£] + B). (31) 

<r p 

Due to (1221) and dHJ) we have 

max[x$(p)+ATrp£] = X *M+\Trp w (I-A) = C($;A l )+\(l-a). (32) 
p 
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Hence 

X*®*(<r w ) + ATm av (E ® I K ) < C($; A 1 ) + B) + A(l - «). 
Noting that 

Tr a av (£ ® /*) = Tra* (/ - A) > 1 - a, 

we obtain (J25j) . and hence (zz) (z). □ 

Corollary 2. T/ie additivity of x- ca P ac ^y f or the Shor's channel exten- 
sions <&(E,q,d) and ty(F,r,e) with arbitrary pairs (E,q,d) and (F,r,e) im- 
plies its additivity for the A -constrained channel $ and the B -constrained 
channel ^ with arbitrary A C &(H) and B C &(JC). 

Proof. This is obtained by double application of theorem 2. □ 

Corollary 3. If the additivity holds for any two unconstrained channels 
then it holds for any two channels with arbitrary constraints. 

Remark 3. The statement of the corollary 3 could be also deduced by 
combining results of [Hj and [0] , but we gave a direct proof here. 

6 Additive constraints 

Let A be a positive operator in 7i, and let 

A {n) = A®---®I H + --- + I H ®---®A 

be the corresponding operator in 7i® n . The classical capacity of the channel 
$ with inputs subject to the additive constraint 

Trp( n M (n) < na; n = l,2,... 

is shown [2| to be equal to 

C($;A,a)= lim C($® n ; A (n) , na)/n. 

n— >oo 

In [6] the following weak additivity property was considered: 
C($ ® *; A <g> J^c + I H <g> B, 7) = max A, a) + 5, /?)] , (33) 

a+/3=7 

where $ and $ are channels with the input spaces 7i and /C, and the corre- 
sponding linear constraints TrpA < a and TrpB < j3. It is easy to see that 
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the additivity for the two constrained channels in the sense (JSJ) implies the 
weak additivity (|33j) . The extension of the latter to n channels implies 



and hence the equality C($; A, a) = C{$; A, a). Indeed, the function /(a) = 
C(<&;A, a) defined by Q is nondecreasing and concave (see Appendix, II), 
whence 



is achieved for a\ = ■ ■ ■ = a n = a. 

The weak additivity conjecture for constrained channels becomes equiv- 
alent to the additivity conjecture in the sense of this paper when this weak 
additivity holds true for any two channels. Indeed, the latter implies global 
additivity for channels without constraints, from which global additivity for 
constrained channels follows by corollary 3. 

Needless to say, however, that in applications constraints usually arise 
when the channel space is infinite-dimensional and the constraint operators 
are unbounded. The finite dimensionality (implying boundedness of the con- 
straint operators) is crucial in this paper, and relaxing this restriction is both 
interesting and nontrivial problem. 



I. The main property underlying the proof of the lemma 1 is the concavity 
of the function x<s>{p) 011 &(T~C)- This function may not be smooth, therefore 
we will use non-smooth convex analysis arguments instead of derivatives 
calculations. 

Consider the Banach space 03^(7^) of all Hermitian operators on 7i and 
the concave extension x<s> of the function x<s> to *Bh(7~Q, defined by: 



where ( B + {H) is the convex cone of positive operators in TC. The function x<s> 
is bounded in a neighborhood of any internal point of ( B + (TC) (and, hence, 
by the concavity it is continuous at all internal points of *B + (7i), which are 
nondegenerate positive operators, see j^j, 3.2.3). 



max 

aiH \-a n =na 



7 Appendix 




9 e 

P e <B h (H)\<B+(H) 
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By the assumption p is an internal point of the cone 03+ (H). Hence, 
the convex function — is continuous at p . Due to the continuity, the 
subdifferential of the convex function — x$ at the point p is not empty 
(see jS], 4.2.1). This means that there exists a linear function l(p) such 
that po is the minimum point of the function — x®{p) ~ Kp)- Any linear 
function on ^& h {H) has the form l(p) = TrAp for some A G 23^(7^). 
Hence, po is also the minimum point of the function — X#(p) under the 
conditions Tr Ap = a = TrApo and Trp = 1. Introduce the operator A' = 
|[|| + and the number a' = i[||A|| _1 a: + l]. The linear variety defined 

by the conditions TrpA = a and Trp = 1 coincides with that defined by the 
conditions Tr A'p = a' and Trp = 1. Therefore, po is the minimum point 
of the function — £$(p) under the conditions TrA'p = a' and Trp = 1, and, 
hence, po is the maximum point of the function x$(p) under the condition 
TrA'p = a'. By concavity of the function x<s>(p) it implies that p is the 
maximum point of the function x<s>(p) under the condition either TrA'p < a' 
or TrA'p > a' (see n. II below). By noting that < A' < I and setting A 
and a to be equal to A' and a' in the first case and to / — A' and 1 — a' in 
the second, we complete the proof of the lemma 1. 

II. If F(x) is a concave continuous function and l(x) is a linear function 
on a compact convex subset of a finite dimensional vector space, then the 
function 

f(a) = max F(x) 

x:l(x)=a 

is concave. Indeed, assume f(a) is not, then there exist «i,a 2 such that 
/( a '^° 2 ) < | [/(cti) + f(a2)] ■ Let Xi be points at which the maxima are 
achieved, i. e. l(xi) = ai and /(««) = F(xi), then l( Xl t, X2 ) = ai +° 2 and 
F(?±±^) < /(sa±ea) < 1 [F(x{) + F(x 2 )] , which contradicts to the concavity 
of F. Similar argument applies to the functions /+(«) = max x: i^< a F(x) and 
f-(a) = max r i(3;)> a F(i) which are thus also concave. 

With the same definitions one has either f(a) = f+(a) or f(a) = /-(a), 
for otherwise there exist Xi,x% such that 

l(xi) < a; F(xi) > f(a); l(x 2 ) > a; F(x 2 ) > f{ot). 

Then taking A = ^t^t~i one has < A < 1, Z(Aa;i + (1 — X)x 2 ) = a and 

F{Xxx + (1 - \)x 2 ) < f(a) < AF(xi) + (1 - X)F(x 2 ), 

contradicting the concavity of F. 
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